Speech coding based on adaptive mel-cepstral analysis
نویسندگان
چکیده
In this paper, we propose an ADPCM coder which uses a backward adaptive predictor based on the adap tive mel-cepstral analysis. The spectrum represented by the mel-cepstral coefficients has frequency resolution similar to that of the human ear which has high resolution at low frequencies. In the coder, since the transfer functions of noise shaping and postflltering are also defined through the mel-cepstral coeflicients, the effects of nose shaping and post6ltering should fit with characteristics of the human auditory sensation. We incorporate a pitch predictor into the ADPCM coder, and evaluate the speech quality based on objective and subjective performance tests. It is shown that the coder at 16 kb/s can produce a high quality speech comparable with that of the CCITT G.721 ADPCM coder at 32kb/s with no algorithmic delay.
منابع مشابه
A 16 kb/s Wideband CELP-Based Speech Coder Using Mel-Generalized Cepstral Analysis
We propose a wideband CELP-type speech coder at 16 kb/s based on a mel-generalized cepstral (MGC) analysis technique. MGC analysis makes it possible to obtain a more accurate representation of spectral zeros compared to linear predictive (LP) analysis and take a perceptual frequency scale into account. A major advantage of the proposed coder is that the benefits of MGC representation of speech ...
متن کاملPerformance Evaluation of Blind Equalization for Mel-LPC based Speech Recognition under Different Noisy Conditions
This study is aimed to develop a noise robust distributed speech recognizer (DSR) for real-world applications by employing Blind Equalization (BEQ) for robust feature extraction. The main focus of the work is to cope with different noisy environments in recognition phase. To realize this objective, Mel-LP based speech analysis has been used in speech coding on the linear frequency scale by appl...
متن کاملA Comparative Performance of Various Speech Analysis-Synthesis Techniques
In this paper, we present a comparative performance of the various analysis-synthesis techniques which separate the acoustic parameters and allow the reconstruction of the speech signal, which is very close to original speech. The analysis-synthesis of speech signal is used for speech enhancement, speech coding, speech synthesis, speech modification and voice conversion. Our comparative study i...
متن کاملPerformance Evaluation of CMN for Mel-LPC based Speech Recognition in Different Noisy Environments
This study is intended to develop a noise robust distributed speech recognizer for real-world applications by employing Cepstral Mean Normalization (CMN) for robust feature extraction. The main focus of the work is to cope with different noisy environments. To realize this objective, Mel-LP based speech analysis has been used in speech coding on the linear frequency scale by applying a first-or...
متن کاملCepstral Features and Text-Dependent Speaker Identification – A Comparative Study
In the study, the effectiveness of combinations of cepstral features, channel compensation techniques, and different local distances in the Dynamic Time Warping (DTW) algorithm is experimentally evaluated in the text-dependent speaker identification task. The training and the testing has been done with noisy telephone speech (short phrases in Bulgarian with length of about 2 seconds) selected f...
متن کامل